A Novel and Automatic Circular Contour 

Approximation Based Breast Tumor 

Classification Algorithm 



M.S.Abdaheer* 1 , EkramKhan* 2 

Aligarh Muslim University, 

Aligarh, India 

sss.viji@gmail.com,sss_viji@yahoo.com 



Abstract — In this paper a novel and automatic method for 
classification of breast malignancy in digital mammogram 
images is proposed. It is based on two features named as 
Normalized mean radial distance as F] and number of 
crossing point between fitting circle and tumor for the circle 
as F 2 . The roughness of the breast tumor out-layer is 
measured in terms of normalized mean radial distance. The 
fitting circle is obtained by considering the centroid of tumor 
as its centre and arithmetic mean of maximum and 
minimum radial distances of contour points from the 
centroid, as its radius. The similarity between tumor and its 
circle is measured in terms of number of crossing points 
between tumor contour and circle for the tumor contour. 
The simulation results show that for a set of 150 tumor 
contours, the proposed method gives 92.67% accuracy for Fl 
and 94.667% accuracy for F2. The performance obtained in 
terms of the receiver operating characteristic (ROC) 
parameters like accuracy (A c ), sensitivity (S e ), specificity 
(S p ), and positive (PPV) and negative predictive values (NPV) 
for Fl and F2 

as(0.92667,0.9241,0.9296,0.9359,0.9167),(0.94667,0.9873,0.90 

14,0.9176,0.9846) respectively. 

Keywords — Breast tumor, Benign, Malignant, Circular 
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I. Introduction 
Breast cancer is the common diseases among 
women who have crossed 50 and above. It 
pushes the cervical cancer in to second spot. The 
death mortality rate is increased every year as 
per the survey report displayed by the American 
cancer society [1]. These surveys are counting 
only on metros but not count in to the rural areas. 
Early detection and characterization of breast 
cancer can apparently improves the situation and 
possibly reduce the chances of false biopsies. 
Many methods like Digital Mammography, MRI 
and Ultrasound scanning are commonly used for 
early detection of breast cancers [3]. However, 
the failure to detect any abnormal lesion at an 
early stage may lead to disastrous consequences. 
The mammographic images show signs of 
obstruction and many direct and indirect 
radiographic signs due to space occupying 



lesions in the tissue region of breast. The 
interpretation of poorly illuminated 
mammography is very difficult and physicians 
with different levels of experience can come up 
with different results for the same breast 
mammography images. To minimize the 
operator related deficiency many image 
processing based algorithms are developed for 
assisting the radiologist to analyze the breast 
mammography. Through this image processing 
based technique, features for classification are 
extracted from the input image [4]. With the 
help of classifier, tumors are classified as benign 
or malignant based on their features values and 
its decision threshold. 

S. Baeg et al classified the breast 
abnormalities based on textural features named 
as denseness and architectural distortion and 
obtained satisfactory performance [5]. 
Rangayyan et al. used acutance as feature for 
classification, achieving a classification rate of 
92.6% for manually traced 54 tumor contours 
[4]. El-Faramawy used various shape features 
like compactness, Fourier descriptor, moments 
and chord-length statics for classifications. 
However, combination of all these shape 
features could achieve only 76% classification 
efficiency[6]. Menut et al. classified the breast 
masses with the help of parabolic modeling 
method with the accuracy of 76% [7]. Arun 
Thaitaikumar et al. reduced the 56.3% of 
biopsies by using axial shear estrography as a 
classifier for classification of breast tumors [8] . 
S.C.Yanga et al. detected and classified the 
breast masses with the help of five texture and 
four shape features as inputs of Parabolic neural 
network classifier and achieved 84.15% 
classification rate with overall area under ROC 
of 0.93 [9]. 
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H. W. Lee et al. analyzed the infiltrative 
nature of breast masses by octave energy 
derived from reversible round off non recursive 
1-D Discrete periodic wavelet transform. A test 
dataset of breast sonograms with the lesion 
contour delineated by an experienced physician 
and three datasets of breast sonograms with the 
lesion contour delineated by a Java-based image 
processing program, ImageJ, are built for feature 
efficacy evaluation. They achieved 95.1% and 
84.4% of accuracy for manual and Image J 
generated data base by combining octave energy 
feature with morphometric feature [10]. Bi Lio 
et al developed local texture based fully 
automated breast tumor classification system 
with the accuracy of 93.75% [11]. T. M. 
Nugayan et al. achieved 94.6% of classification 
accuracy with the help of fractal dimensions 
based classifications method. Moreover, 
mammographic mass boundaries are usually 
blurry, making it difficult for automated 
detection schemes to detect precisely the mass 
boundaries. For these reasons, features 
quantifying the textural information contained 
by masses are needed to perform classification 
independent of shape [12]. 

In the present study, we focus on the 
development of texture measures for the 
classification of mammographic masses as 
benign or malignant. The rest of the paper is 
organized as follows. Section 2 describes about 
materials and methods. Section 3 describes 
about result and discussion. Section 4 will 
finally conclude the paper. 

II. MATERIALS AND METHODS 

The necessary data set for testing and 
implementation of our algorithm is taken from 
MIAS data base [2], R.M.Rangayyan data base 
[4] as well as some of the contour from our local 
hand drawn data base. The given input 
mammogram image is enhanced with the help of 
morphological opening filter with disk 
structuring element of order 5. The enhanced 
image is segmented through intensity histogram 
based optimum threshold technique [13]. The 
ROI (region in which tumor is present) is 
automatically traced out from segmented 
foreground image. The edges of the 
automatically traced ROI are calculated through 
canny edge detector . The artifacts or high 
frequency noises which are present over on the 
extracted edges are removed through pre 



processing technique. In this work, 
morphological opening filter with disk 
structuring element of order 3 is used as a pre 
processing filter to improve the visual 
impactness of extracted contour image 
(extracted edges) [14]. The improvement of 
visual impactness is measured in terms of peek 
signal to noise ratio. The pre processed contour 
act as an input image of the proposed breast 
tumor classification algorithm as shown in Fig.l. 
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Fig. 1, Contour Extraction from Digital Mammogram Images 

The main objective of this algorithm is to 
draw a circle for the given input tumor. By 
keeping centroid of the tumor as the center of 
the circle, circle is fitted on the given tumor. 
The radius (R) of the circle is obtained as an 
arithmetic mean of minimum and maximum 
radial distance which is measured from centroid 
of the tumor to coordinate of the contour. The 
normalized mean radial distance value depends 
up on the roughness of the counter. By keeping 
centroid of the tumor as a center of the contour, 
the circle is fitted on the tumor. The fitted circle 
may cross and merge with different points of the 
given tumor. The number of crossed point 
between fitted circle and tumor for the circle is 
depends on the roughness of the out layer 
(contour) of the tumor. The procedure to draw 
the circle and determination of its parameters 
are discussed in the following sub sections. 

A. Centroid Calculation 
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For a given contour, its centroid is calculated 
by approximating contour into polygon with N 
vertices. The coordinates (x;, yi) of each pixel of 
the contour (C) act as vertices of image [15]. If 
vertices of N points are known then the 
coordinate of centroid is calculated as follows; 



JV 

2> ! ' 



Yc 



N 



(1) 



whereas rough structured and complex natured 
malignant tumor likely to have less value of 
normalized radial distance. Therefore 
normalized mean radial distance used as a 
feature for classification of breast tumors. 

The normalized mean radial distance (N mr ) 
of the given tumor is defined as follows. 

JV.,=-nr(x,y) (3) 



ivtr 



y=i 



where 



N is the total number of pixels in a given 
contour image mi is the weighting factor 
(Number of white pixels in row containing the 
vertex) x c and y c are the coordinates of the 
centroid. 

B. Converting Radial Distance as the Radius of the circle 

Once the centroid (O) of the tumor is obtained, 
it is used as the centre of the best fitting circle. 
The next step is to find the radius of the best 
fitting circle, which is the arithmetic mean of 
maximum and minimum radial distance between 
the centroid and the contour as shown in the 
Fig.2. The radial distance, r(x, y), between i 1 
contour point (xi, yi) and centroid (x c , y c ) can 
be measured as follows: 



In the proposed algorithm, the normalized 
mean radial distance defined in Eqn. (3) is used 
as feature for the classification of tumors in 
either benign class or malignant class, by 
comparing the feature value Fi= iV mr with a pre- 
determined threshold (Ti) (to be discussed in 
the next subsection). As stated earlier that 
benign tumors have smooth, round or oval 
shaped contour, and thereby larger value of Fi, 
whereas the malignant tumors having rough and 
ill structured contour surface will have small 
value of Fi. Therefore, for a given threshold (Ti) 
and a contour with feature value Fi, the 
classifier works as follows; 



ifF.KT, 
else - 



malignant tumor; 
benign tumor 



(4) 
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The arithmetic mean of minimum and 
maximum radial distance defined in Eqn (2) 
used as a radius of the best fitting circle. 



Then the radius of best fitting circle is 
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(b) 



Fig.2 Radial distances measurement of (a). Benign tumor 
and (b). Malignant tumor 

As evident fromFig.2, that measured radial 
distance value depends upon the roughness of 
tumor contour. The smooth and round shaped 
contours of the benign tumors likely to have 
larger value of normalized mean radial distance, 



By keeping centroid r of a given tumor as a 
center of circle and radjus calculated from Eqn. 
(5), the circles for benign and malignant tumors 
are shown in Fig. 2. 
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(a) 



(b) 



if p(x, j/) = O then 

pO + 1, j/) = O; pCx - 1, 3/) = O 



Fig 3. The Best Fitting Circles for contours of (a) Benign Tumor 
and (b) Malignant Tumor 



C .Calculating Number of Crossing Points (N c ) 

As evident from Fig. 3 that the overlapping 
area between a given contour and corresponding 
fitting curve depends on the roughness of the 
contour. The smoother contours of benign 
tumors are likely to have less number of 
crossing points with the fitting circle, whereas 
the contours with rough outlier of malignant 
tumors are likely to have more number of 
crossing points with the circle. Therefore, the 
number of crossing points between circle and 
tumor for the circle is used as a feature for 
classification of breast tumors. 

For exactly count the number of crossed 

(N N N N N ) 
points ^ L ' 2 ' 3 'Vi.^iJ , number of 

(N ) 
merged points v m> are calculated and 

subtracted from number crossed and merged 
points^ cm ' . 

(N ) 
Number of Crossed Points c 

(N X ,N 2 ,N 3 N n _ u N n ) . 

l<n<N-l, ViV eC(x,y)nC/r(>,y) 



(6) 
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where 



~- Number of crossed and merged points 
Number of merged points 



Numbers of crossed and merged points are 
calculated as follows 

l.The given input image I(x, y) is scanned 
from top to bottom and left to right. 

2. If pixel p of given input image I(x, y) is one, 
then it will pass in to next pixel. 

3. If pixel p of given input image I(x, y) and its 
four neighborhood pixel value is equal to zero 
then it will check its radial distance 



p(x, y~) = 



p(x, y - 1) = O; p(x, y + 1) 
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4. The radial distance of corresponding pixel is not 
equal to radius of fitted circle Ii (x, y), then it 
will be counted as a number merged points. 
if Pi(x,y)up(x,y) 
count = 
then 



7(x + l,y,r(x-l,y) ] 
rO,y-l),r(x,y + l)J 



\r(x + Xy),r(x-l,y)\ 

lr(x,y-l),r(x,y + l)j 



Count = Count + 1 = JV„ 



where 
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(8) 



(9) 



(10) 



P i is the pixel of the fitted circle 
P is the pixel of given input breast tumor contour 
x & y is the coordinates of the pixel. 
Total number of pixel whose values is zero also 
counted as total number of merged and crossed points 
N 

5. Total number of crossed point is calculated 
by subtracting total number of merged points 
from total number of points. 



i\L =N„ 



N. 



(11) 



In the proposed algorithm, number of crossed 
points (N c ) defined in Eqn (11) is used as 
feature for the classification of tumors in either 
benign class or malignant class, by comparing 
the feature value F 2 = N c with a pre-determined 

threshold (T2) (to be discussed in the next 
subsection). As stated earlier that benign tumors 
have smooth contour, and thereby smaller value 
of F2, whereas the malignant tumors having 
rough contour surface will have large value of 
F2. Therefore, for a given threshold (T) and a 
contour with feature value F, the classifier 
works as follows; 
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ifF 2 >T 2 
else = 



malignant tumor; 
benign tumor 



(12) 



D. Decision Threshold 

It is evident from Eqn (4) and Eqn.(12) that for 
classification, a threshold value (T) is to be 
calculated for classification. For a given training 
set of extracted contours of tumors (with known 
class and feature value F), the threshold value (T) 
is calculated for Feature Fi and F2 as follows. 
The decision threshold for Fi is calculated as 
arithmetic mean of maximum value of Fi in 
malignant class and minimum value of Fi in 
benign class, which is as follows 



classifiers are calculated 
efficiency of classifiers. 
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(17) 
(18) 

(19) 



T = 



P + F 

^lM.max ~ r r lB,mm 



(13) 



where, 

Fm, max is the maximum feature value of 
malignant class with in training data set. 

Fb, min is the minimum feature value of benign 
class with in training data set 

The decision level for F2 is calculated as 
arithmetic mean of maximum value of F2 in 
benign class of training set and minimum value 
of F in the malignant class, which is as follows 



T = 



F +F 

2B,max 2M,min 



(14) 



where, 

Fb, max is the maximum feature value of benign 
class with in training data set. 

Fm, min is the minimum feature value of 
malignant class with in training data set 

Based on this decision level, tumors (training 
and validation sets) are classified according to 
Eqn (4). and Eqn.(12). The performance of the 
classifier is evaluated through Receiver 
Operating Characteristics (ROC). In ROC, if 
malignant tumor is classified as malignant, it is 
called as a true positive case (TP). If malignant 
is classified as benign or benign is classified as 
malignant then it is called as a false negative 
(FN) and false positive (FP) respectively. If 
benign class is predicted as a benign then it's 
called as true negative (TN). Accuracy (Ace), 
hit rate (TPR), false alarm rate (expense) (FPR), 
specificity (Sp), positive predictive value (PPV) 
and negative predictive value (NPV) of 



III. RESULT AND DISCUSSION 

The performance of the proposed algorithm is 
evaluated for a total of 150 tumors (90 as a 
training set and 60 for validation and 
verification). Out of these 150 tumors, there 
were 71 benign tumors and 79 malignant tumors. 
Features (normalized mean radial distance (N rm ), 
Number of crossing points (N c )) for 
classification is extracted through our proposed 
algorithm for all 150 breast tumor contours. 
Decision threshold defined in Eqn.(13) (for FI) 
and Eqn.(14) (for F2 ) is calculated for 90 
contours of training set (with known feature 
value and class) through stastical based 
optimum threshold technique and is found to be 
0.8 ,03 for Fi and F2 respectively. 
Fig. 4 shows the feature values (or defined in 
Eqn.(3) & Eqn(ll)) for tumors classified as 
benign and malignant. It can be observed that 
there exists a clear cut demarcation between the 
feature values of two classes of tumors at 
decision threshold values. Table 1 and Table 2 
shows the performance of the proposed 
algorithm (based on FI and F2 respectively) for 
all 150 tumors in terms of classification 
accuracy defined in Eqn. (15). It can be 
observed from Table 1 that the proposed method 
results into misclassification of only 5 out of 71 
benign tumors and only 6 out of 79 malignant 
tumors, giving the overall accuracy of 92.66%. 
Similarly based on second feature (number of 
crossing points (F2)), the proposed algorithm 
performance is summarized in Table 2 It can be 
observed from Table 2 that the proposed 
algorithm misclassified only 5 out of 71 benign 
tumors and 6 out of 79 malignant tumors , 
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giving the overall accuracy of 94.667%. Table 3 
and Fig. 5 compare the performance of the 
proposed method with other existing methods in 
terms of overall classification accuracy defined 
in Eqn.(15) . It is evident from the table and 
figure that the accuracy of the proposed method 
is approximately 10-19% better than its 
contemporary methods based on Fi and F 2 . 



s e , s p , PPV, NPV defined in Eqn (15)-(19). The 
ROC parameters for the proposed method and 
other similar methods are summarized in Table 5. 
It can be observed from the table that the 
proposed method outperforms than other 
methods consistently for all ROC parameters, 
which show the ability of our algorithm to 
efficiently classify the breast malignancy. 
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TABLE 1. Breast tumor classification Based on Normalized Mean 
radial distance (Fl) 



(a) 



an 



7S f ■ ■»■ ■ ■■■■_■■■■■■■■■■■■■■ B« r KB ■ ■ ■■■ ■ ■ ■«■*"■" "l" 

;: Breast tumor classmcation[Nc) 

Malignant 
.Benign 



E 



eg 

> 



rz 



20 
15 



'I 10 

Q. 




i , t w t * , _ . ******** r 

rriTiriTir m n n n n n n n n n n n (TiTiTiT n iTiTiTiriTiTi n n n n n n nn n n n nT 

1 7 131925313743495561677379 

Numberof tumors 



Input 
Data's 



Benign 



Malignant 
Total 



Case 



71 



Based on Normalized mean radial 
distance 



Classified as % 

Benign Malignant Accuracy 

~66 T05 92.95% 



79 
150 



06 

^12 



73 
~78~ 



98.73% 
92.66% 



(b) 

Figure 4. Classified tumors with its Feature (Variance) Value (a) Based 
on Fl; (b) Based on F2 



TABLE 2 Breast tumor classification Based on Number 
points (N c ) 


of crossing 


Input 
Data's 


Case 


Based on 
distance 


Normalized 


nean radial 


Classified 


as 


% 
Accuracy 


Benign 


Malignant 


Benign 


71 


66 


05 


92.95 


Malignant 


79 


03 


76 


96.20 


Total 


150 


69 


81 


94.667 



TABLE 3. Performance Comparison of proposed method with the other 
existing methods 



The performance of the proposed method is also 
evaluated in terms of ROC parameters such as A c , 
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S.No 


Name Author' (s) and their 
Proposed Method 


Percentage 
of Accuracy 


1 


Oliver Menut et.al/ Parabolic 
Modelling Method[3] 


76% 


2 


N.R. Mudigonda et.al /Iterative 
Boundary Segmentation 
algorithms[ll] 


81% 


3 


W.H.lee et al /Wavelet Based 
Breast tumor Classification 
Method[6] 


84% 


4 


Proposed 
Method 


Based on Fl 


92.667% 


Based on F2 


94.667% 



Conclusion 
In this paper we proposed a simple novel and 
automatic algorithm for breast tumor 
classification. The classification is based on two 
features which can be used to measure the 
roughness of the outlier of the breast tumor. The 
simulation results showed that the proposed 
efficiently classifies a wide range of tumors and 
outperforms other methods in terms of ROC 
parameters. Another salient feature of this 
method is that it is simpler compared to other 
method as only centroid and radius of the circle 
are need to be calculated. 
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Figure 5. Comparison with other existing System (in terms of Accuracy) 
Table 5. Comparison of ROC parameters with other existing system 



Measured 

ROC 

parameters 


Wavelet 
Based 
Method 


Local 
texture 
Method 


Our Proposed Method 


Based on 

Fi 


Based 
on F2 


A c 


0.84 


0.9375 


0.92667 


0.94667 


Se 


0.933 


0.95 


0.9241 


0.9873 


s P 


0.795 


0.9231 


0.9296 


0.9014 


PPV 


0.714 


0.9344 


0.9359 


0.9176 


NPV 


0.956 


0.9412 


0.9167 


0.9846 
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